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CYCLE STRUCTURE OF RANDOM PERMUTATIONS 
WITH CYCLE WEIGHTS 
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Abstract. We investigate the typical cycle lengths, the total number of cycles, and the 
number of finite cycles in random permutations whose probability involves cycle weights. 
Typical cycle lengths and total number of cycles depend strongly on the parameters, 
while the distributions of finite cycles are usually independent Poisson random variables. 
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1. Introduction 

Weighted random partitions and random permutations appear in mathematical biology 
and in theoretical physics. They are appealing because of their natural probabilistic 
structure and their combinatorial flavor. The sample space for random permutations is 
the set Sn of permutations of n elements. Given a G 5„, we let Rj{cr) denote the number 
of cycles of length j in a. The probability of a permutation a is then defined as 

^'(-) = i n (1^1) 

Here, 6i,92, ■ ■ ■ are nonnegative parameters and /i„ is the normalization that makes P„ a 
probability distribution, i.e., 

cr&Sn j ^ 1 
1 
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Notice that Ri,R2, ■ ■ ■ satisfy the following identity for all cr G 5^: 

n 

Y,jRM) = ^- (1-3) 

The case 9j = 1 corresponds to random permutations with uniform distribution and it 
has been studied e.g. in [Til El HH]- The case 6j = 6 is known as the Ewens distribution 
and it was introduced for the study of population dynamics in mathematical biology [12] . 
See [21 [T5l IT3] and references therein. Random permutations with restriction on the cycle 
lengths can be described with parameters 9j G {0,1}. Results have been obtained for 
random permutations restricted to finite cycles [181 E] or to cycle lengths of given parity 
|16j . Another situation of interest is when we fix the asymptotic behavior of 9j for large 
j. Such a setting was considered in [1], and it also appears in the study of the quantum 
Bose gas in statistical mechanics [3 [8]. The case where 6j converges to a constant (i.e., 
the Ewens case, asymptotically) was considered in [161 1^. Vanishing parameters 6j — )• 
were studied in [H [9] where logarithmic cycle lengths were observed. 

The random variables to be discussed in this article are the following: 

• Li(a) gives the length of the cycle that contains the index 1. Since our probability 
distribution is invariant under relabeling, we can interpret Li as giving the length 
of a "typical cycle", i.e. the length of the cycle that contains a random index. 

• K{a) = Yl]=i Rj{^) gives the total number of cycles in the permutation a. 

• We also consider the distributions of Ri,R2, ... as n — )• oo, i.e., the number of 
finite cycles. In most cases, they will be seen to converge to independent Poisson 
random variables. 

The probability distribution P„ depends only on the cycle structure {Ri{a), . . . , Rn{cr)) 
of a G Sn- It can therefore be understood as a distribution on nonnegative integers 
(ri,...,rn) that satisfy ^jrj = n. As is well-known, these numbers are in one-to-one 
correspondence with integer partitions (Ai, A2, . . . ) with Ai ^ A2 ^ ... and ^ Aj = n, 
by defining rj = #{i : Aj = j}. Taking into account the number of partitions that 
correspond to a given set (ri, . . . ,r„), the probability of a weighted random partition is 
then 

P.(A..A....)^in±(|)-. ,1.4) 

A random variable such as Li cannot be expressed in terms of {Rj}, but one can give 
an interpretation of its distribution in the context of random partitions. Given a random 
partition, pick i £ {1, . . . , n} uniformly at random, and take j that satisfies 

Ai + • • • + Xj^i <i ^ Ai + • • • + Aj-. (1.5) 

The distribution of Xj is identical to that of Li. 

Let us discuss the heuristics behind the dependence of cycle lengths on parameters. It 
is tempting to think that, if 9j is increasing, longer cycles are favored. But this turns out 
to be incorrect. For instance, typical cycle lengths are of order n for 9j = 1; they are 
of order (logn)^/"^ for 9j = e^"' with < 7 < 1; they are again of order n for 9j = . 
The reason for this apparently erratic behavior is best understood from the perspective of 
statistical mechanics, as already mentioned in [9j. We can assign to each index a weight 
that depends on the length of the cycle it belongs to. Let Li{a) denote the length of the 
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cycle that contains i, and notice the identity 

n n 

= ^3ajRj{(y), (1.6) 

i=i j=i 

that holds for any a and any numbers ai , 02 , . . . . Choosing Oj = i log 9j , the probability 
distribution P„ can be rewritten in the form of a "Gibbs state" , namely 

1 '"' 

1=1 

The weight o-Liiu) plays the role of the negative of the energy. The heuristics become 

• If = ilog^ j is increasing, indices prefer to be in longer cycles, and typical cycle 
lengths are longer. 

• If Oj = i log dj is decreasing, the converse happens. 



Expression (1.7) also points to an important symmetry of the parameters: Adding a 
constant c to each aj does not affect P^- But it amounts to changing the parameters from 
9j to 9j e^^ . This also shows that purely exponential parameters are equivalent to uniform 
random permutations. 

We have summarized the results about weighted random permutations in Table [l} Pre- 
cise claims can be found in Sections [3}|8j The behavior of Li is strikingly similar when the 
parameters grow sub-exponentially or decay super-exponentially. Notice that typical cycle 
lengths were obtained earlier in the asymptotic Ewens case |16|[9] and when 9j decreases to 
[9j. We complement these results with statements about finite cycles and total number 
of cycles (see also [1]). We also show that the joint distribution of Li, L2, . . . converges to 
the Poisson-Dirichlet distribution. 

2. Generalities 

2.1. Generating functions and basic expressions for random variables. Let us 

start with the exponential generating function of weighted cyclic permutations. There are 
(n — 1)! cyclic permutations of n elements, and therefore 



n\ ^ — ' n 

n ^ 1 crgcSn n ^ 1 

cyclic 

A general permutation can be viewed as a combinatorial set of cycles, so that the generating 
function of permutations is given by the exponential of the generating function of cyclic 
permutations (see Corollary 6.6 of [Ij). We set ho = 1. Then 

G,iz) = Kz- = E ^ E n <^ = E (2-2) 

n ^ n > ' o-e5„ j > 1 n ^ 1 

We now obtain expressions that characterize the random variables Li, K, Rj. For part (c), 
we use r[fc] to denote the descending factorial, 

r[fc] =r(r-l)...(r-A; + l). (2.3) 

Proposition 2.1. 
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(b) E„,(K)=x;%^. 

" h 

(c) E„, {Rj)[k.]^ = ^ ^ n (^) ^ •^^'^ '^^^ integers A;i, . . . , fc„ suc/i i/iai jfcj ^ 

n. 

Before proving this proposition, let us mention two useful consequences. Summing over 
all possible values for j in the identity (a), we get a relation for the hnS, namely 

K = ^^ejK-j. (2.4) 

" i=i 

The following corollary will apply to all regimes of parameters that we consider, except 
super-exponential growth or decay. 

Corollary 2.2. When hn-i/hn — )• 1 as n — )• oo, the joint distribution of the num- 
ber of finite cycles, Ri, R2, R3, ■ ■ ■ , converges weakly to independent Poisson with means 

1/1, 2 5 3 5 • • • • 

Proof. We have from Proposition |2.1| (c) that 

JimE.(n(i?.)fe])=n(7)'^' (2-5) 

for all ki,k2,. . ■ with finitely many nonzero terms. The result is then standard, see e.g. 
Lemma 2.8 of [H. □ 



Proof of Proposition \2. 1\ The sum over permutations with Li = j can be done by first 
summing over the (j — 1) other indices that belong to the cycle that contains 1 (there are 
(n — 1) . . . (n — J + 1) possibilities), then by summing over permutations of the remaining 
(n — j) indices. We get 

P„(Li = j) = ^(n - 1) . . . (n - j + l)%(n - j)!/i„_, = (2.6) 

We now prove the identity (c). We use the generating function Gfi{s). Let /ci, /c2, . . . be 
nonnegative integers such that jkj ^ n. Recall that hn and Gh depend on 61,62, .. ■ 
Using hn = [s"']Gh{s), we have 



. jhr, 
1 . / TT d^^ 

s 



hn 



(2.7) 



As a special case of (c) we have KniRj) = which yields the expression (b) for 

the expected number of cycles. □ 
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2.2. Saddle point analysis. Several asymptotic results will be derived using the method 
of steepest descent, which prompts us to introduce it here. We refer the reader unfamiliar 
with this method to [17J for appropriate background. For our particular application we 
will require a uniform extension of this method to a family of descent problems essentially 
indexed by n. This extension may more generally be referred to as saddle point analysis. 
What this entails and how it is justified will be discussed in the final section. We state 
here the main result based on this method and how it will apply to the cases of interest 
in this paper. More details can be found in Section |9j 

Let G{z) be a function that is analytic at the origin with Taylor series there having a 
finite, non-zero radius of convergence (which we will take to be 1 in all cases). For r > 0, 
let 

a{r) =r {log G{r)y, 
p[r) = ra [rj 



and assume that 



lim a(r) = oo; lim /3(r) = oo. 

r— >1 r— ^-l 



Assume further that for n sufficiently large there is an e such that on (1 — e, 1) there is a 
unique solution to the equation 

a(r) = n (2.9) 

and denote this root by r„. Notice that in this range Tfi is increasing and lini^_).QQ 
Then the main saddle point result we use states that 

nG(z) = ^^^^(1 + 0(1)). (2.10) 

We will apply this in two cases: (i) for the generating function Gg of the parameters; 
(ii) for the generating function Gh of the coefficients hn- 
It is convenient to introduce 

I^{Z) = J2 ^^^nZ"". (2.11) 
n ^ 1 

These functions satisfy the following recursion relations 

I,{z) = zl'^_,iz). (2.12) 
In the case of the parameter generating functions we take 

Ge{z) = z-Ho{z) = ^I-i{z). (2.13) 

az 



One easily finds that 



Define p„ by the equation a{pn) = n. Then, applying (2.10), the parameters are asymp- 
totically equal to 

On+i= f' ^P'^^ — il + oil)). (2.15) 
2 ■s/2-Ka'{pn) 
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Let US turn to the second case, i.e., the generating function of hn given by (2.2). We 
have the relations 

Gh{z) = expl_i(z), 

a(r) = /o(r), Io{rn) = n, (2.16) 
/3(r)=/i(r). 

Notice that r„ is increasing in n and that hmr„ is equal to the radius of convergence of 
Jo- The coefficients /i„ are asymptotically equal to 

g/-l(r„) 

K = —====(l + o{l)). (2.17) 

We will actually deal with ratios of those numbers, and the following bounds will prove 
extremely useful. 

Proposition 2.3. Assume that 6i,92,..- are such that the saddle point approximation 
involving Gh is valid. Then 



Proof. The asymptotic approximation ( 2.17| ) for hn implies that 



where A(n, j) can be written as 

A{n,j) = I-i{rn-j) - h{rn~j) logr„_j - /-i(r„) + Io{rn) logr^. (2.19) 



By the fundamental theorem of calculus, using (2.12) 



A{n,j)= — (/_i(n)-/o(n)log^x)dn= / >^ ^ Au. (2.20) 

We can bound logu ^ logr^j—j. The integral of I\{u)/u yields Io{rn) — Io{^n—j) — J &rid 
we get the lower bound of the proposition. The upper bound is similar, using logn ^ 
logr^. □ 

3. Parameters with super-exponential growth 

First we consider the regime when 9n diverges fast enough. It is not too hard to 
show that only one cycle of length n is present, meaning also that all finite cycles have 
disappeared. 

Theorem 3.1. Assume that On > for all n, and that 

lini y = 0. 

Then 



n-^oo — un 



lim P„(Li =n) = l. 
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It immediately follows that P„(ivr = 1) — ^ 1 and Pn(-Rj = 0) — ^ 1 for all fixed j. 

Let us check that the theorem applies to the parameters 9n = e" with 7 > 1. We have 

= e-""[i-(^)"-(i-^)"] . (3.1) 



■'n 



It is easy to check that (1 - s)"^ ^ 1 - cs for ^ s ^ | with c = 2(1 - 2"'^) > 1. Then 
forj ^ f, 

^ e-""[^n-{i)"l ^ e-(^-i)"""'^ (3.2) 

It follows that 

^^fe ^ 25^^^^ ^ 2 J] e-(^-i)"^"^ (3.3) 



which clearly goes to as n —t- 00. 



Proof of theorem 3.1 . By the assumption of the theorem, there exists N such that 6jdn-j < 



On for all n N . Let C such that /i„ ^ COn for all n ^ N. We now prove by induction 



that this upper bound holds for all n. By (2.4) and the induction hypothesis, 

Q n+l 

hn+l ^ — — T %^n+l-j ^ (3.4) 



We have Pn(ivi = n) = by Proposition 2.1 (a). Using \2A\ , we get 



Using hn-j ^ C9n-j and the assumption of the theorem, we see that ^ ^ 1 as 
n — )• cxD. □ 

4. Parameters with sub-exponential growth 

Our goal here is to understand the regime of parameters that grow sub-exponentially, 
e" with < 7 < 1. It turns out to be difficult to tackle this case directly and we 
appeal to an indirect approach, by focusing on the generating function rather than its 
parameters. 

Let A, a, b, c be positive parameters to be chosen later, and let 

Ge{z) = A{1 - e»(i-^)"' . (4.1) 

Then the parameters, are the coefficients of /q or, equivalently, the shifted coefficients 
of Ge: 

0„ = [z"-i]Ge(z). (4.2) 

It is not hard to check (by repeated differentiation) that On > ^ for all n ^ 1. Notice 
also that the radius of convergence of Gq is 1. 

Proposition 4.1. 

A(a6)6TT(|-^) 1 , 6+2 N 

— ^ ■nb+i'^ 2 I 



7n+l 



1 / 1 

-t;; , , , „ , , , „ , , 

V0\ 



L(aO)°+l 2 ' 1 , 6+2 -, r I- b_, b ,-, 2 -, , b-1 ^ 

^refc+i'- 2 exp<^ Ia(6+l)(a6) t+i I ni'+i + 1 i(a6) f'+i I n^+i +o(n''+i 
i/27r(6 + l) 
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Proof. We use the saddle point method, which has long been used for this class of functions 
EOl. We have 



a{z) = z-^logGe{z) = z 
az 



+ 



ah 



l-z ■ {l-zf+^ 
h{z) = h{z) {a{z) + 1) = Io{z) [abz{l - z)-^-^ + cz(l - z)-^ + l] 
Notice that, as z — )■ 1, 

a'{z) = ab{b + 1)(1 - + o(l)). 

Defining p„ by a(p„) = n we obtain 



^(1 



By the implicit function theorem, we get 

Pn = 1 - (^)'^ + - + o(n-(^^^)) 

We compute the terms that appear in the formula ( |2.15[ ) for On- First, 



(4.3) 

(4.4) 
(4.5) 

(4.6) 



Ge{Pn) = A{^)>'^^ exp{a(^) 



exp 



|(o6) b+l 



b , . , 6-1 

6+1 _j_ / n_\ t+1 
^ b+1 KabJ 

b-l 



b+1 



+ on 6+1 



6+1 n 6+1 + i|^(a6)6+in6+i + 5^ + o( 



6-1 
nfc+i 



vo 



a{pn) = (6 + l)(a6) fli^S? (1 + o(l)). 



We get the proposition by inserting these values into (2.15). 

We choose the numbers A, a, 6, c so that On ~ e"^ . Precisely, let 

6 



(4.7) 



□ 



7 

1-7' 



a 
c ■ 



(1-7)71-^, 



^ + 1 
2 ' ^' 



(4.8) 



A = V2vr(6+ l)(a6)"^(^"'^\ 



They imply the following relations, that are often useful when checking the details of the 
calculations: 

7=bh, ab = ^^, 5^1 = 1-7- (4.9) 
With these numbers, the precise asymptotic expression of On is 



On = expjn^ + 172^2^-1 + o(n(27-i)vO)|. 



(4.10) 



Notice that On = e"^ (1 + o(l)) when 7 < ^. The case 7 = 2 '^^^ be handled by modifying 
the number A. For the case 7 > the correction ■n?'^~'^ is present in the exponential and 
it cannot be removed easily. 

It is time to state the main result of this section. 



Theorem 4.2. Consider the set of parameters 01,02, 

b, c i 

(1-7)-^/^. 



whose generating function is 



given by Eq. (4.1) with A,a,b,c specialized as in Eq. (4.8). Then 



(a) 



(log n 



1I/7 



10 



NICHOLAS M. ERCOLANI, DANIEL UELTSCHI 



(b) Ri,R2, - ■ ■ converge weakly to independent Poisson random variables with respec- 
tive means 9i,^, . . . . 



Proof. Let = (1 — 7) . We show that for any e > 0, 

Li 



hm 



(log n) 1/7 



B 



By Proposition 2.1 (a), we have 

Li 



(logn)i/7 



B 



> e 



> e 



E 



0. 



Ojhn- 

nhr,. 



(4.11) 



(4.12) 



I w B\>£ 

We use the saddle point method for the generating function Gh = e^-^ . The equation 
loifn) = n implies that 

-n = l-[Mog^^]-^/^ (4.13) 

(We keep using A, a, 6, c rather than 7 for convenience.) By the implicit function theorem, 
we get 

r„ = l-ai(logn)-Uo((g^) 

(4 14) 

= exp{-a^(logn)-i + 0[^g^ V (logn)-f )}. 
It follows that /i(r„) = n{l \ognf/'i{l + o(l)). We use Proposition 2.3 to get 

hn-3 



y5^(l^fc))'^''exp{-,ai(logn)-Hl + 0(l^V(^ (4.15) 

The cases j = n and j = n — 1 need actually to be handled separately. Using the expression 
(4.10) for 6j and the bound above, it is easy to check that 



hr, 



lim > — ■- 

n^oo ^-^ nhn 
j=n/2 



0. 



(4.16) 



For 1 ^ j ^ n/2, we have 

^j^n—j 



E 



(logn)^/ ' 



nhr. 



^ C expjj''' — jab (log n) 6— logn 



+ 0(/^^-^n + 0(^V^^)}. (4.17) 
Let us make the change of variables j = i(logn)^/'^. Then 



E 



(log n)^/ t 



9h 

nhr, 



c E 



■ logn[ab i-i-y +l+o{l)] 



(4.18) 



je{Iogn)-i/7p 
|j--B|>e 



It is easy to see that the function f{x) = abx — x"' + 1 is convex with a minimum at 
B = {1 — "y)~^^"', where it takes value 0. For |x — i?| > e we can bound f{x) > 6\x — B\. 
We can estimate the sum by an integral, in order to get 



E 



(logn) ' ' 



nhn 



< C(logn 



,1/7 



\x-B\>e 



g-(logn)5|x-i?| (4 -^g) 



CYCLE STRUCTURE OF RANDOM PERMUTATIONS WITH CYCLE WEIGHTS 



11 



which clearly vanishes in the limit n — )■ oo. This proves (a). 

It is clear from Proposition 2.3 and Eq. (4.14) that — )• 1 as n — )• oo, so that 



(b) follows immediately from Corollary 2.2 □ 



5. Parameters with algebraic growth 



We again work with a generating function rather than parameters. Recall that 7 > 0, 
and let 

r(7 + i) 
(i-z)- 

One easily checks that 



W= n-.mi -r(7 + l)- (5.1) 



d" r(n + 7 + l) 



One then gets the parameters: 



e„ = [.»]/„(.) = ii^/„(o) = E(l±^. (5.3) 

n! dz" n! 

By a straightforward application of Stirling's formula one sees that the parameters grow 
algebraically: 

^„ = n^(l + o(l)). (5.4) 



Theorem 5.1. Choose 9i, 62, ■ ■ ■ such that their generating function is given by Eq. (5.1 ). 

Then 

1 

(a) Li/n^+i converges weakly to the Gamma random variable with parameters (7+I, a) 

1 

with a = r(7 + 1)^+1 . In other words, we have 

lim P„f— ^ <s)= r x^e-'^'^dx. 

(b) lim n~^En{K) = (T{-/)/-f^)^' . 

n— >oo 

(c) The distribution of number of finite cycles converges weakly to independent Poisson 
random variables with means Oi,^, . . . . 

Proof. We use the saddle point method. Let r„ be defined by loirn) = n. Then 

rn = l-(r(f^ + l)~^. (5.5) 
It is enough for our purpose to retain 

1_ 2+7 1_ 2_ 

r„ = 1 — an i+T+0(n i+^)=exp|— an i+t + 0(n ^+t)|. (5.6) 



2.3 



In order to use Proposition 

on that j < Cn^^^^'^"'^ for a constant C independent of n. A few calculations yield 



we check that • is close to r^,. We assume from now 



r: 



■■' — exp{— ajn i+t + 0(j^n ^+1 V jn ^+^)}. (5.7) 



2+7 2 



Then 

r^, =r^(l + o(l)). (5.8) 

We also have 

/i(z) = r(7 + 2)z(l-z)-^-2. (5.9) 
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n 



h{rn) _ Jjn 
h{rn-j) Tn-j^n-j 



2+7 
1+7 



e^'^"^^' (l + o(l)) 



(5.10) 



It follows that 



h{rn) 



y h{rn-j! 



(5.11) 



We can now proceed to the calculation of the distribution of Li. Using Proposition 2.1 
(a), we have 



ll/{l+7) 



nV(i+7) 



< s 



-1/(1+7) 



n 



;i + o(i)). 



(5.12) 



(We can use the asymptotic value for 6j because finite j contribute a vanishing amount. 
We rescale the variables in order to recognize a Riemann integral: 



,1/(1 + 7) 



ni/(i+7) 



< s 



(5.13) 



ni+7 



As n — 7- oo, this converges to the probability that the Gamma random variable with 
parameters (7 + l,a) be less than s. 

For part (b) we use Proposition 2.1 (b). Using 6j = p{l + o(l)) and Eq. (5.11), we 
have 

n 

E„(i^) = ^j^-ie~«^"'''^^'' (l + o(l))- (5.14) 
i=i 

Notice that the contribution of finite j vanishes, which justifies using the asymptotic 
expression for 6j. Introducing the appropriate scaling that leads to a Riemann integral, 
we rewrite the expression as 



7 



■E„(A') 



-E(^)''"<'-°'""*(i+<'(i))- 

+ 1 , 1 7+1 ^ 



717 



The right side converges to ^ e ^ da; = r(7) and we obtain the claim (b) 



Part (c) follows from (5.11) and Corollary 2.2 



(5.15) 



□ 



6. Asymptotic Ewens parameters 

Past studies of the Ewens distribution have focused on the number of cycles. It was 
shown in particular that the number of cycles with length less than is approximately 
equal to ^slogn for all < s ^ 1, and that it satisfies a central limit theorem jl5j 
and a large deviation principle [13]. In this section we consider the case where 6j — )• as 
j ^ 00. We look at the distribution of finite cycles and at the joint distribution of the 
largest cycles. Let L^^\L^'^\ . . . denote the cycle lengths in nonincreasing order (for all 
a e Snwe have ^jjRjicr) = L^*) = n). 
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The large cycle lengths converge to the Poisson-Dirichlet distribution. In order to de- 
fine it, first consider a sequence of i.i.d. beta random variables with parameters (1,0), 
{Xi,X2, . . . ). That is, F{X > s) = (1 — s)^ for ^ s ^ 1. Then form the sequence 
{Xi, (1 — Xi)X2, (1 — -'^i)(l — X2)X3, . . .). It is not hard to check that it is a random 
partition of [0, 1], which is called the Griffiths-Engen-McCloskey distribution. Reorganiz- 
ing these numbers in nonincreasing order gives another random partition of [0, 1], and the 
corresponding distribution is called Poisson-Dirichlet. 

Theorem 6.1. Assume that On — t- 6. Then, as n ^ oo, 

(a) the random variables Ri, R2, R3, ■ ■ ■ converge weakly to independent Poisson with 



respective means 9i,^,^, . . . ; 
(b) the total number of cycles is logarithmic: lim 



Z/i) 
ra ' n ' 



En(jr) 

log n 



converges weakly to Poisson-Dirichlet with 



(c) the joint distribution of 
parameter 9. 

The last result involves only the limit 9 and not the individual parameters 9jS. This is 
not surprising as the longest cycles become infinite as n — t- 00. The theorems of |15| I13 | 
also concern cycles of diverging lengths and they should remain valid in the asymptotic 
Ewens case without modifications. On the other hand, the distribution of finite cycles 
depends explicitly on the 9jS. 

The rest of this section is devoted to the proof of this theorem. It relies on estimates 
for the normalization Let us introduce the function A(a:), x ^ 1, by 

' )=-pE— (6-1) 
where ^ s < 1. 



A 



1 



Lemma 6.2. The function A is "slowly varying" in a strong sense. Namely, let (x„) and 
{Un) be any two diverging sequences such that there exists a constant C > 1 with 

1 



C 



Xf 
Vn 



< c 



for all n. Then 



Proof. We need to show that 



A(x, 



hm 

n->-oo A[y„ 



E 

j > 1 



(6.2) 



converges to as n — t- cxd. Given e > 0, let A^^ such that \9j — 9\ < e for all j > N^. The 



sum over the first terms of ( |6.2[ ) goes to as n 

1 



= £ 


log — 




Vn 



00. The rest is less than 
^ elogC. 



The expression (6.2) is then as small as we want when n is large enough. 



(6.3) 
□ 



Using the definition (6.1) and recognizing the Taylor series of the logarithm, we have 

1 



G,(.) = (l-.)-U(— ). 



(6.4) 
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Proposition 6.3. 



n 



m 



k{n) (l + o(l)). 



Proof. The generating function of /i„ being given by (6.4) with A a slowly varying function, 
we can use the Tauberian theorem of Hardy-Littlewood-Karamata (see Theorem 9 of |6]) 
to obtain 

n-l fl_i 

(6.5) 



We need to remove the Cesaro average in the left side. From (2.4), we have 

„ n— 1 n— 1 

hn = -Y.^j + ^Y.^Gn-j-e)hj. (6.6) 



n 



j=0 



j=0 



The first term of the right side can be combined with (6.5) and it gives the right result. 
We need to check that the correction due to the second term is irrelevant, i.e., we need to 
check that 



1™ A / 



e)hj = 0. 



(6.7) 



j=0 



Let e > 0. Using (6.5), we first have 

(l-£)n 



n«A(n) ^ '"""^ 



\hj ^ ( sup \9j 

^ en 



((l-e)n)^A((l-e)n) 



n' 



^A{n)r{e + 1) 



;i + o(l)), (6.^ 



which clearly vanishes in the limit n — )■ oo. Second, let C = supj \ 9j — 9\, and observe that 



n*^A(n) 



n-l n-l (l-e)n 

j={l-e)n j=0 j=0 

Then 

1 """^ C 



j={l-£)n 

which is arbitrarily small since e is arbitrary 
We can now prove the theorem. 



(6.10) 
□ 



Proof of Theorem 6.1 The claim (a) easily follows from Lemma 6.2, Proposition 6.3 and 
Corollary [2:21 

For the claim (b) , we first observe that the number of cycles of length larger than ^^^^^ 

is less than ylogn, so we only need to consider smaller cycles. This means that we can 
sum up to 



in the expression of Proposition 2.1 (b) for E„(i^). By Proposition 



6.3 



the ratio ^ ^ converges to 1 as n — )• 00, uniformly in 1 ^ j ^ n/yTogn. It follows that 



E (K) 1 "/^^^^ 

lim = lim ^ 



logn 



E 



log n ' J 



(6.11) 



We turn to part (c). Let Li,L2, ■ ■ . denote the lengths of the cycles when they have 
been ordered e.g. according to their smallest element. That is, Li = Li is the length of 
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the cycle that contains the index 1; L2 is the length of the cycle that contains the smallest 
index that is not in the first cycle; and so on... We show that for all k, 



( hi -^2 Lj, 

^ n n — Li n — Li — L2 n — Li 



L 



k-l 



'Li L2 



converges to i.i.d. beta random variables with parameters (1, 9). This implies that 
converges weakly to GEM(0); and reordering the cycle lengths in nonincreasing order yields 
PD{9). It is enough to show that for any k and any ai, . . . , G (0, 1), we have 



lim P„(— ^ 01,..., ^ 

n^oo \ n n — Li 



L 



k-l 



^ at) = ll[l-{l-aiY 



(6.12) 



i=l 



The right side is the beta measure of the product of intervals x^^^(0,aj). 

We proceed by induction on k, starting with k = 1. By Proposition |2.1| (a), we have 



A — ^ ai 
V n 



We used Proposition 



1 "1" L 

1 X - „ n. 



j=o " j=o 



-iA(ra-i) 
A(n) 



;i + o(l)). (6.13) 



6.3 



to get the second identity. By Lemma 



6.2 



the ratio ^1" P 

A{n) 



converges to 1. We clearly have a Riemann sum, so that 

lim fJ— ai) = 9 [ \l- xf'^dx = 1 - (1 - aif . (6.14) 

n— >oo \ n ) Jq 

Next, we assume that the claim has been proved for k and we prove it for k + 1. Let 

A=|(^i,...,4)e{l,...,ri}'=:- ^ ai,..., ^ aA. (6.15) 

L n n — u — ■ ■ ■ — ik-i J 



It is not hard to verify that, on A, 

k 

n-ii 4 ^ nJJ(l-ai). 



(6.16) 



i=l 



We have 



„ (Li,...,Lfc) e A 



{^i,...,4)eA 



n—Li —Lk 

(Li =4,...,^ = 4) 



Ofc+l 



n—Li 



Li 



(6.17) 



Now we use the self-similarity of weighted permutations: Having chosen the first k cycles, 
the distribution of the (A; + l)th cycle is identical but with less indices available. Precisely, 
we have 



n— Li 



4 



■ n-ii 



< Ofc+l 



(6.18) 

(Notice that the {k + l)th cycle in the left side has become the 1st cycle in the right side.) 
The right side of the equation converges to the beta measure of (0,0^+1). Convergence is 
uniform in (ii, . . . £ A because of (6.16). The right side of (6.17) then converges to 
the beta measure of the product of intervals x^^j^(0,at) by the induction hypothesis. □ 
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7. Parameters with sub-exponential decay 

The second regime with long cycles occurs for parameters that go slowly to 0, such as 
9n = with 7 > 0, or 0„ = e~" with < 7 < 1. It is not hard to check that the 
assumptions of the theorem below are satisfied in both these cases. Notice that the results 
about the RjS and about K have already been proved in |4j in the case On ~ n~'^ . 

Theorem 7.1. Assume that < "g^ ^ < cj for all n and all 1 ^ j ^ ^, with constants 

Cj that satisfy ^ 1 "f" < co. Assume also that — ?• 1 as n —t- 00. Then hj < 00, 
and 

lim Pn(Li = n — m) 



Ylij > 

In addition, Ri, R2, R3, . . . converge weakly to independent Poisson random variables with 
respective means 9i,^ , . . . , and K — 1 converges to Poisson with mean y. 

Proof. The claim about Li was proved in [9]. The claim about the Rjs follows from 
Corollary |2.2| and from the fact that 

= ^(1 + 0(1)) (7.1) 
n 

with C = Y^hj. This was proved in [9], see Eq. (3.12) there. 

In order to prove that K — 1 converges to a Poisson random variable, let m, k be fixed. 
We consider the set of permutations 

A = {a:Ri{a) + --- + R„,{a) = k-l}, (7.2) 

and B the set of permutations where exactly one cycle has length larger than m. We have, 
for all n > 2m, 

AnB C {a : K{a) = k}, {a : Liia) ^ n - m} C B. (7.3) 

Then 

m 

¥n{K = k) ^ P„(^)-P({Li ^ n-my)=¥n{A)-l + Y,¥n{Li=n-j). (7.4) 

We take the limit n — >• 00. Since Ri + ■ ■ ■ + Rm converges to Poisson with mean YlJLi 
we get 



We now take the limit m — t- 00 and we get 



liminfP„(iv: = A;) ^ rrfV— ) e~^^>^T. (7.6) 

n^oa (k — 1)! V ^-^^ J ) 

Summing over ^ 1, the left side is less or equal to 1 by Fatou's lemma; the right side 
yields 1. This shows that the inequality above is actually an identity, and X — 1 is indeed 
Poisson in the limit n — )• 00. □ 
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8. Parameters with super-exponential decay 



We conclude our study of random permutations with cycle weights by discussing the 
case 6n = e""'' with 7 > 1. It was actually studied in [9j, where the typical cycle length 
was proved to be a fractional power of logn, namely 

Li 



1. 



1 



((7 - l)logn)i/7 

We complement this result with a claim about the number of finite cycles. It is actually 
not very sharp, but it provides useful information nonetheless. 

Theorem 8.1. As n ^ 00, we have 



'3 J 



exp07 



/ lo£C Tl \ ~ — ~ / — ^ 

' * +0 (logn) 7 



}■ 



V7-I. 

Proof. The radius of convergence of Gg and Gh is now infinite. Let r„ satisfy Io{rn) = n. 
It was shown in [9], see Eq. (4.32) there, that 

log n ~ 

It follows that 



exp< 7 



logn 

7^ 



log n / ) 



i.2) 
U) 



We can then express Vn-j in term of r„. 



o((logn) 7 j 



where the precise meaning of o(-) is that for any e > 0, there exists such that 



o((logn) 



7-1 , 



7-1 

(log n) 7 

for all J, n such that n > N and n — j > N . 
Next, we observe that the parameters satisfy 



< e 



-0-1)^ 



(8.4) 
(8.5) 

(8.6) 



e ' = e 

so that ^ e"^-'"^^^ for all j large enough. It follows that if r is large enough, 

Ii(r) = je-^'' ^ ^ e-^^"^)^ = r(/o(r) + 1). (8.7) 

Since Ii{r) is increasing in r and r„ is increasing in n, we have for n large enough. 



1 ^ 



Iljrn) 

h{rn-j) 



Jo(rn) + 1 



Using Proposition 2.3 and Eq. (8.2), we have 

hn-j f . /- log n 

^exp<^j7' 



hr. 



7-1 



7-1 

+ o(i(logn) V^j. 



2.1 



(c) is ^n{Rj) 



h„ 



(8.9) 

Combining this with the previous 



A special case of Proposition 
equation, and neglecting 9j/ j which is less than the error, we get the claim of the theorem 
for all j finite. □ 
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9. Uniform saddle point estimates 

Since the two cases of generating functions that we have considered are entirely similar, 
we initially restrict attention to the case of the generating function for the /i„ (as specified 



in (2.16)). Gh{z) = exp/_i(2:) is analytic in the unit disc and hence the normalization 



coefficients we want to study are naturally given by the Cauchy representation 



' f G'.(z)^ (9.1) 



27ri 



where Cn is the circle centered at of radius r„. With respect to polar coordinates along 
Cn this becomes 

hn = ^ r exp f/_i (rn e'^)-n log (r e'* ) ) dc/.. (9.2) 



2vr„ 

We observe that the function 

Fn{z) = I-i{z)-n\og{z) (9.3) 

which is positive and continuous on (0, 1) approaches oo at both endpoints and therefore 
attains a minimum value at r„. This point is unique and, as we have already observed, 
explicitly given as the critical point satisfying 

= F^(z) = -/-i(z)--, (9.4) 

or equivalently 

rn = lQ\n). (9.5) 

We now consider a complex neighborhood (in z) of r^. Since Fn{z) is analytic in the 
unit disc minus the origin, one may assume that Fn is analytic on the chosen neighborhood 
of r„ and hence r„ must be a saddle point of F^ (by the maximum principle) . The integral 



(9.2) is complex- valued, so it is natural to try to apply the method of steepest descent 
[17j here. One may describe this approach in terms of a dynamical system; viz., the 
Cauchy- Riemann equations for the analytic function Fn{z) may be viewed as a gradient 
dynamical system with potential Re-F„(2). The critical point z = r„ is a fixed point of 
this system and the locus ImF(n) = cuts out the stable and unstable manifolds of this 
fixed point. The real axis is the stable manifold in all the cases we consider. The steepest 
descent curves are the components of the unstable manifold; Fn{z) = ReF„(z) decreases 
monotonically along these curves as one moves away from the fixed point r„ . The situation 
is illustrated in the left graphic of Figure [T] below which depicts these stable and unstable 
manifolds for a particular case of algebraically growing parameters. For background the 
reader is referred to section 2.3 of [T7]. We begin by Taylor expanding Fn[z), as given 



by (9.3), near r„, and applying Taylor's form of the remainder theorem to derive the 



representation 

F„(.) = (/_,(r„) - nlog(r„)) + (. - r^f + MM^37U^0 + 2MM^(, _ ,„)3 

= Fn{rn) + An[z - Tnf + Bn{z - rnf{l + o(l)). (9.6) 



CYCLE STRUCTURE OF RANDOM PERMUTATIONS WITH CYCLE WEIGHTS 



19 



where 

r~n = + o(l)) 

A, 



h{rn) 



Br, 



2rl (9.7) 
h{rn) - 3Ii(rn) + n 



With z = X + iy^ the local structure of the stable and unstable manifolds is given by the 
locus 

ImF„(z) = 2An{x - rn)y + B^ {S{x - Vnfy - y^) 

= y {2An{x - r„) + Bn (3(x - r^f - y^)) (9-8) 
= 0. 

Indeed, y = locally describes the stable manifold which we have already seen to be the 
X-axis while the remaining factor, which to leading orders has the form 

y2 = ^(x-r„), (9.9) 

locally describes a parabolic arc for the unstable manifold (steepest descent curves), con- 
sistent with the example shown in Figure [T} 

One may similarly expand the real part of F„ (first line below) and then restrict it to 
the unstable manifold (second line below), 

Re Fniz) - Fn{rn) = An [{x - r„)2 - y2] + Bn [{x - Tnf " 3(x - r„)y2] (1 + o(l)) 

= [An{x - Tnf + Bn{x - rnf) (1 + o(l)) (9.10) 



where in the second line we have used (9.9) and the fact, which will be seen below, that 
Bn dominates An as n — ?• oo. 



We next apply these observations to the contour integral (9.1). By Cauchy's Theorem 
the contour of integration, C„ may be deformed within a region of analyticity without 
affecting the value of the integral. We will deform to a contour of the form C = D^+C—D- 
where is the sub-locus of the steepest descent curve in the upper half plane starting 
at rn and terminating at a point zq, inside the unit disc, to be determined. D- is the 
conjugate reflection of -D+ in the lower half plane. C is the circular arc of radius \zo\ 
starting at zq and terminating at zq. (Note that although Cn is not equal to the steepest 
descent path, it is tangent to that path at r„.) 

We concentrate first on the integral over the steepest descent contours of C At the end 
of this section it will be shown that zq may be chosen, depending on n, so that that the 



quadratic term in (9.10) goes to infinity with n while the cubic term goes to zero, zq itself 
will tend to 1 along with r„ as n — )• oo. It is then natural to make the change of variables 
along D±: 



F{rn) - Fn 
h{rn) 



2tI 



(9.11) 

{x-rnf+o{l). 
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With this we have 
1 
27ri 



dz 



27ri 

gF„(r„) 

vri 

pi^n(r„) 



20 





era 



dx 



(9.12) 



da(l + o(l)) 



where (Tq = (xq — Tn). In the last hne the change of variables (9.11) was imple- 

mented. As already mentioned, at the end of this section it will be shown that a choice of 
can be made consistent with all prior estimates and for which (Tq — ?• oo as n — )• cxd. It 
follows that 

dz p-FnCrn) 

:(1+0(1)). (9.13) 



1 

27ri 



V27rli(r„) 



To complete the verification of (2.17), as well as the similar argument for (2.10), one 
still needs to argue that the global error coming from the integral (9.2), restricted to C, 
is asymptotically negligible in comparison to (9.13). We illustrate the situation with two 
images from the case of algebraic growth (specifically, the instance of Gh for (5.3) where 
7 = 1 with n = 100). 



Figure 1. saddle point and level curve at r„ 

As has already been described, the graphic on the left in Figure [T] shows the saddle 
point r„ with paths on the real axis ascending from the saddle and the other two curves 
descending from the saddle {steepest descent curves) into the upper and lower half planes 
respectively. This illustrates the fact, stated before, that the contour Cn is tangent to 
the steepest descent curves. The graphic on the right shows the level curve, passing 
through r„, of the real part of Fn{z). This level curve is also a locus where the magnitude 



of the integrand of (9.2) is constant. Note that this level curve is quite close to being 



circular away from a small neighborhood of r„. This property is shared by other nearby 
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level curves (for level values different than -F„(r„)). This suggests that the order of the 
magnitude of the global error is bounded by the order of the absolute value of the integrand 
of (9.2) evaluated at r^e**^" . We will take a slightly different tack here which essentially 



accomplishes the same estimate but is easier to implement. Namely, we return to C and 
observe that the value of F{z) (which equals ReF(2;) along the unstable manifolds) is 
decreasing along the unstable manifolds as one moves away from Hence, the value of 
the integrand in (9.1) at the respective endpoints zq,zq = |zo|e*'^° of C is exponentially 



smaller (in n) than its value at r„. We further observe that 



1 

27ri 



,^nl--i{z) 



C z 



n+1 



-dz 



1 1 



e 



nRe7_i(|2o|e'<^) , 



1 1 nReJ-id^ole'-^O ) /" nRe [7_i{|^ol e'"* )-/-i|^ol e'-^O )] 



1 1 

^ --. — ^e 



(9.14) 



nRe7_i(|.o|e'^0)^^_^^^^ 



TT l^ol 

where the last inequality follows from the fact that 



Re 



/_i(|zo|e^'^)-/-i(|zo|e'^") 



3 > 1 



J 



|zoP (cos(j(A)-cos(j(/<o)) ^ (9.15) 



for </) G (00, vr). It follows that the integral over C is exponentially negligible in comparison 



to (9.13). 



Finally we return to the claim made just prior to (9.11 ) that may be chosen so that, in 



(9.10), the quadratic term grows to infinity with n while the cubic term decreases. Given 



the analysis presented in sections 4 and 5 and in particular the estimates (|4.8|), (|4.14|) and 



(5.6, 5.9) it suffices to show that the order of xq 



may be chosen so that 



^i(r„)(xo 



oo 




as n — >• oo. The following table summarizes the orders in n of the relevant terms and 
presents a choice for the orders of (xq — r„) for each of the cases of generating functions 
that we consider in this paper. In each case the choice is given in terms of a weighted 
geometric mean of the growth rates for /i(r„) and l^ij'n)- (Note that in the case of Gg, 
Tn should be replaced by p^-) 





0{h{rn)) 


0{h{rn)) 


O{xo - r„) 


O{h{rn){xo-rn?) 


O{h{rn){xo-rnf) 


Gh{z), alg. 




-i+'j, 


1 + 

n 12 T+i 


1 ~l 


_ 1 ~l 

Jl 4 7 + 1 


Gg(z), sub-exp. 








715'' 




Gh{z), sub-exp. 


n{logn)~ 


n(logn)^ 


n 12 (log n) 12 -1 


( " 




V(logn)i/-' ) 





The arguments in this section follow the general strategy of Hayman's method. We refer 
the reader to Chapter VIII of [14J for a nice overview of this technique. 
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